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INTRODUCTION 



Discrete time series arise in many different contexts. 

For example the exact arrival times in an arrival process are 
usually not measured. Instead the number of arrivals in suc- 
cessive time intervals are given. This is the case with the 
statistics published by the Center for Disease Control on the 
incidence of various diseases in the United States. The data 
are given as the number of occurrences of each disease in suc- 
cessive days. If the time intervals are short enough and the 
arrival process is orderly, then the resulting time series is 
approximately binary. In other instances the process that is 
being measured is continuous but the data is quantized in 
recording. For example, the amount of rainfall in a day 
(24 hours) at a location, given that some occurs, is a continu- 
ous random variable; however, it is often recorded to the 
nearest one-hundredth or one-tenth of an inch. Also, since a 
rainfall series will often contain many zeros (no rain), an 
analysis is often made of successive wet and dry days which is 
a binary time series [cf. Buishand, 1978]. An economic impera- 
tive for modelling and predicting the binary rainfall series is 
that it is the primary concomittant variable for predicting 
volume of business done in some establishments on successive days 
Markov chains have been used as models for stationary 
discrete time series. However, they are overparametrized for 
statistical purposes. Further, the data to be modelled can 
often be shown to be non-Markovian , or at least not first-order 
Markovian. Higher order Markov chains can be used but this only 
aggravates the problem of overparametr ization . 



In the past several years various parametrically simple 
models have been proposed for stationary discrete time series. 

The models have as parameters the fixed, first-order marginal 
distribution of the time series and the correlation structure. 

In Jacobs and Lewis [1978a, 1978b] a simple scheme is given for 
obtaining a stationary sequence of discrete random variables with 
a given marginal probability mass function tt and an autocor- 
relation structure like that of a mixed first-order autoregres- 
sive- (N+1) st-order moving average process. This DARMA(1, N+1) 
process has nonnegative correlations and a possibly countably 
infinite state space. The correlation structure is determined by 
parameters that are independent of the marginal distribution. 

A special case of the DARMA(1, N+1) process with mar- 
ginal probability mass function it is the DAR(l) process. 

This is a Markov chain with discrete state space IE and with 
transition matrix 

(1.1) P = pi + (l-p)Q , 

where Q is a matrix with = 7r(j) for i,j€IE ; ^ is the 

identity matrix with (i,j) element I^j and 0 _< p < 1 . The 
correlation structure of a real valued DAR(l) i.e. one for which 
IE is a subset of the real line, is that of a first-order auto- 
regressive process with kth-order serial correlation equal to 
p . There is no limitation on tt ; a common and useful assump- 
tion is that it be Poisson and therefore have an infinite state 
space. The DAR(l) model with a finite state space is a special 
case of the mover-stayer model [Bhat, 1972, p. 302-9]. 

Lindqvist [1978] studied a real valued finite state 
space Markov chain with a transition function of the form 
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( 1 . 2 ) 



P. . 
11 



= cl 



11 



+ (1-c) Q 
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where = '^ ( j ) , i' 
IE is finite, the constant 
values with the constraint 



j €IE as before. Since the state space 

c can take on some negative 
that 



(1.3) max [1- (l-7i(i))~^]£c£l , 

l<i<r 



where r is the number of elements in the state space. 

In Jacobs and Lewis [1978c], the DAR(l) process was 
extended to obtain a sequence of discrete random variables 
with pth order Markov dependence and given marginal distri- 
bution. The DAR(p) process is defined as follows. Let 
be a sequence of independent identically distributed random 
variables with = 1} = 1 - = 0} = 1 - p, 

0 £ p < 1; is a sequence of independent identically 

distributed random variables taking values {l,2,...p} with 
P{A^ = i} = , i = l,2,...,p; and is a sequence of 

independent identically distributed random variables with 
discrete state space IE and = i} = ir(i). Let 



(1.4) 



Z = V Z ^ 
n n n-A 



n 



+ (1-V )Y 
n n 



The process is called a DAR(p) process. Note that by 

direct argument from (1.4) 
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(1.5) 
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i-, , . . . ,Z =i } 

1' n-p+1 p-^ 



P{Z 



n+1 



Z = 
n 



P 

= (1-p) iT(j) + I pa,e.(i ) , 

k=l ^ D K 



where = 1 if i = j and £j(i) = 0 otherwise. There 
is no limitation on the marginal probability mass function tt . 

Pegram [1980] considers a real-valued finite state space 
model which is a generalization of the DAR(p) model in that 
its conditional probabilities are of the form 



( 1 . 6 ) 



P{Z 



n+1 




7 * 

• ' n-p+l 




= [1 



P 



I 

k=l 



Tl(j) 



P 

+ I <})].£■ (i.) . 

k=l ^ ^ ^ 



where k=l,...,p} are (possibly) negative constants. Note 

that although some of the constants (})j^ may be negative, the 
admissible values for depends on the marginal distribution 

IT. It was shown in Jacobs and Lewis, [1978c] that 
Corr(Z^, , k=l,2,,,. for the real valued DAR(p) process 

are nonnegative. Pegram 's model allows some of the correlations 
to be negative. The amount of negative correlation, as in 
Lindqvist's model, depends on the marginal distribution it. 

In this paper we will consider models for real-valued 
stationary discrete time series whose nonnegative correlation 
structure is that of a mixed pth-order autoregressive and qth-order 
moving average process. Thus we have a generalization of all 
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of the preceding models. In Section 2 we will give definitions 
of two such models, DARMA(p,N+l) (discrete mixed autoregressive- 
moving average process with orders p and N+1 respectively) 
and NDARMA(p, N). We briefly describe some of their properties 
and suggest an estimate for the correlations. In Section 3 
we describe in detail a simulation experiment that was done 
to study the behavior of various estimators for the first order 
serial correlation coefficients p of the DAR(l) model for 
small and moderate sample sizes. In Section 4 some extensions 
of the DARMA models are briefly discussed including one which 
can have negative correlations. Throughout the remainder of 
the paper we will assume that the NDARMA and DARMA processes 
are real valued. They can in fact be used to model categorical 
time series, but then numerical measures such as correlations 
are meaningless. 
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2. The DARMA(p,N+l) and NDARMA(p,N) Processes. 

In what follows we let be a sequence of independ- 

ent identically distributed random variables taking values in 
a real-valued discrete state space IE with P{Y^ = i} = •n’(i), 
ielE . Let and be independent sequences of independ- 

ent random variables taking the values 0 and 1 with 

(2.1) P{U^ = 1} = 3 and P{V^ = 1} = p 

for fixed 0 £ 2 £ 1 and 0 _< p < 1. Let be a sequence 

of independent identically distributed random variables taking 

values 0,1,2, ...,N with P{D, = n} = 6 , n = 0,1,. ..,N, and 

fc n 

{A^} be a sequence of independent identically distributed 
random variables taking values l,2,...,p with P{A, = n} = a , 
n — 

2.1 The DARMA(p,N+l) process. 

The DARMA(p,N+l) process is a sequence of random 
variables which is formed according to the probabilistic 

linear model 



( 2 . 2 ) 



X = UY + (1-U)Z ,,, 

n n ^ n-(N+l) 



for n = 1,2,..., where the "autoregressive tail" is 



(2.3) 



Z = VZ ^ +(1-V)Y 

n n n-A n n 

n 
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This process differs 



for n = -N-p+1, -N-p+2,.... 
from the DARMA(1,N+1) process defined in Jacobs and Lewis 
[1978a] in that the "autoregressive tail" is now the pth 

order autoregressive process of (1.4). In Jacobs and Lewis 
[1978c], it was shown that the vector-valued Markov chain 
{(Z^^, ^n-p+1^ ' ~ 1,2,...} has a limiting joint 

probability mass function v with marginal probability mass 
function tt . Hence, if ( , . . . , Z_^) has joint probability 
mass function v , then n = 1,2,...} is a stationary 

process with marginal probability mass function tt . 



Then {r(k)} can be shown to satisfy the following system of 
equations : 



Let r(k) = Corr(X^, the stationary process. 



(2.4) r(l) = 3^ I 



2 N-1 



+ 3(1-B)rg(l) + (1-3) ^r^(l) , 



i=0 



2 N-2 

(2.5) r(2) =3 I 

i=0 



‘^i‘^i+2 3(l-3)rg(2) + (l-3)^r^(2) 



(2.6) r(N) = 3^6 q 6^ + 3(l-3)rg(N) + (l-3)^r^(N) 



(2.7) 



r(N+k) = 3(1-3) rg(N+k) + ( 1- 3) ( N+k) . 



k > 1. 



In these equations 



( 2 . 8 ) 




k > 1 , 
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which satisfy the following Yule-Walker equations : 



(2.9) ^ paj^r^(O) + pa2r^d) + ... + papr^(p-l) , 

(2.10) ~ pa^^r^d) + pa2r^(0) + ... + papr^(p-2) , 

• • • • 

• • • • 

(2.11) ^A^P^ ^ + pa2r^(p-2) + ... +*papr^(0) , 

and for k ^ 1 , 

(2.12) r^(p+k) = paj^r^(p+k-l) + pa 2 r^(p+k- 2 ) + ... + pa^r^(k) , 

where ~ 

In addition 



N 



rg(i) = Corr(Z^^._,^^^, , 



is obtained recursively as 



rg ( 0 ) =0 ; 

Tg ( 1) = ( 1-p) ; 

rg(2) = paj^rg(l) + (l-p)6j^_j^ ; 

k-1 

ill d-p) '^^-(k-l) 

for k < min(p,N) ; 
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i-j. 




for max(p,N) > k ^ inin(p,N) ; 




i=l 



(k-i) 



for k > max(p,N) . 



To see that the serial correlations for the DARMA(p,inH-1) 



process are all nonnegative let q(i) (respectively q (i)) be 

the probability that and (respectively and 

Z ,.) choose the same random variable Y, , where, because of 
n+i k 

the backward definition of the autoregression k < n. Then 

q(i) (respectively q^(i)) also satisfy equations (2.4) - (2.7) 

(respectively (2.9) - (2.12)) and since they are nonnegative, 

so are the serial correlations. 

To see this identity, let R be the random index of 

the Y, , k < n, that X chooses; that is, 
k ' — ' n 



Then, since the random variables are independent of the 

random variables. 
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(2.13) 



fx X ^.1 = eTy^ Y_ 

L n n+£j 



= 1 = 



= ■'> 



n n+£ r 
I I E Y Y P{R 

k=l j=l L ^ ^ 






jT^k 



(2.14) 



= E 



P{R = R 

n n+£) 



Thus 



+ E 



'’“'n ^ W' 



=<n+«' = = 



'■'■’n = ■'n+d' 



+ E 









(2.15) 



= Var 



[’'l] ’’'■’n = 






Therefore 

(2.16) Corr(X^, = P(R^ = = q(t) 



as asserted above. 

This identity will also be used in the estimation 
procedure proposed in Section 3 for the serial correlations. 
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2.2 The NDARMA(p,N) process 

In this subsection we will define another related 
discrete time series with the correlation structure of a mixed 
moving average autoregressive process. This new process is 
more reminiscent of the linear ARMA(p,N) process. The key idea 
that leads to this new model is that a probabilistic mixture of 
a finite number of random variables each with probability mass 
function n has probability mass function tt even if the 
random variables are dependent. Thus it is not necessary to 
define the autoregression via an autoregressive tail, as in the 
DARMA(p,N+l) process; the autoregression can be made explicit, 
as in the usual (normal theory) linear processes. 

Thus let 



(2.17) 



X 



n 



= V X - 
n n-A 



n 



+ (1-V )Y ^ 
n n-D 



n 



where tv^}, and {D^} are as before. Thus, with prob- 



ability 



X 



n 



is one of the p previous values 



X .,..., X and with probability (1-p) it is a mixture of 

n-1 n-p 

the previous ' n - N ^ k _< n . Note that if p = 0, 

then {X^ ; n = 1 , 2 , . . . } is a DMA(N) process as defined in 
Jacobs and Lewis [1978a]. If P{D^ = 0} = 1, then {X^} is 
a DAR(p) process as defined in Jacobs and Lewis [1978c]. 



Let T = inf{i : 6^ > 0} . Note that 



Z = { (X 
•^n n 



X 



n-1 



,...,X ,, ,Y ,...,Y .J ,n=l,2,. 

' n-p+1 n-T n-N 



is a Markov Chain with state space IF which is equal to the 
product space of IE with itself p + (N-t) times. Since 
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f • • » f 




Y^} > (1-P)6^ TT(j) 



there is a set JciiF 




where 



K = P + N and 



^ ^ ^^n+K "^n+K-T' ^n+K-1 ’^n+K-l-x 



f * * • f 




Thus the condition of case (b) on page 173 of Doob [1953] is 
satisfied. The proof on pagesl73 and 174 extended to countable 
state spaces shows that has a limiting probability mass 

function v as n ^ ; further the convergence of the con- 
ditional distribution of to v as n ^ is geometric. 

The marginal probability mass function of v is ir . 

It follows from (2.17) that the serial correlations 
for the stationary NDARMA(p,N) process satisfy the Yule-VJalker 

equations for the ARMA(p,N) process with restrictions on the 
range of the coefficients; 



(2.19) 



r^(k) = Corr(X_^, 



P 






N 

+ (1-p) \ 6.Corr(X 

i=k ^ 



n 



Y 



n+k-i 



12 



for k > 0 . 



The correlations r,.^(i) = Corr(X , Y .) 

NB n n-i 

be computed recursively as follows. For i = 0 



(2.20) = (l-P)6o ' 

for 1 _< i < p 

" (1-P)6j^ + pa3_rj^g(i-l) + ... + pa.r^^(O) 



and for i > p 



^NB 



(i) 



(1-P)6^ + 



P 

D = 1 



/ 



where if i > N , then 6^ = 0 by convention. 



can 
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Hence, if we assume N < p 



(2.21) " paj^r^(k-l) + pa2r^(k-2) + ... + papr^(p-k) 

i=k 



for 1 < k < N ; 

(2.22) '*’ + ... + paprj^(k-p) 

for k > N . 

The serial correlations of the NDARMA(p,N) process are 

nonnegative since, if probability that and 

X , • choose the same random variable Y, , k < n , then 
n+i k — 

{q^^d)} satisfies equations (2.21) and (2.22). The argument 
is the same as for the DARMA(p,N+l) case. 



2.3 Comparison of Admissible Range of Correlations for the 
DARMA(1,1) and the NDARMA(1,1). 

Let be a stationary DARMA(1,1) process; that is. 



(2.23) 



X 



n 



Y 

n 



Z 



n-1 



with probability 3/ 
with probability 1-6, 
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where 



(2.24) 




with probability p, 
with probability 1-p. 



Equations (2.4) - (2.12) for the DARMA(p,N+l) correlations 
simplify to 



(2.25) r(k) = Corr(X^, = p*^ ^ (1-B) [ 6 (1-p) + (l-6)p] • 



Similarly let be a stationary NDARMA(1,1) process; 

that is 







n~l 


with probability 


P, 


(2.26) 


X ' = • 
n 


Y 

n 


with probability 


(1-P) <Sq, 






,Vi 


with probability 


(1-p) (1-6 q) . 


Equations 


(2.21) 


and ( 2 . 22) 


simplify to 




(2.27) 


r„(k) 


= Corr (X ' , 
n 


_,i \ k— Ip 

^n+k^ = P 


(1-P)^6q(1-6q: 



Figure 1 gives graphs of the attainable values of 
(rj^(2), rj^(l)} as the parameter values p and 6^ vary, and 
{r(2), r(l)} as the parameters p and B vary. Note that 
although the set of attainable correlations for the NDARMA(1,1) 
process is not strictly contained in that for the DARMA(1,1) 
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process, it is much smaller. Thus, the DARMA(1,1) model appears 
to be broader than the NDARMA(1,1) model. The smaller region 
of possible correlation pairs for the NDARMA(1,1) model seems to 
be a constraint due to the explicit autoregression on ^^-1 ' 



2.4 An Estimator for the Serial Correlations of the 
DARMA(p,N+l) and NDARMA(p,N) processes. 

The usual estimator for the serial correlations of a 
stationary real-valued sequence {X^,..., X^} is 



(2.28) 



?(Ji) 




m-il 

(m-£) I (X. 

j = l 



X) (X.^^ - X) , 



where 

(2.29) 



-] ™ 

X = m I X. 

j = l ^ 



and 

T ^ O 

<2-30) = Sri ,Z - X) • 

D=1 



In this subsection we will suggest another estimator 
for the serial correlations of the DARMA and NDARMA processes. 

By the remarks at the ends of Sections 2.1 and 2.2, 
the Jlth serial correlation, r(il), for both the stationary 
DARMA (p,N+l) and NDARMA(p,N) processes is equal to the prob- 
ability that X^ and choose the same , k < n . 

Hence, for both processes for i 7 ^ j , 
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P{X = i, X ,, = j} = P{Y = i, Y = j} 



= I I PfRn = "n+l = Pf^k = i- 



k<n r£n+£ 

r^k 



n+ £ 



Since the {Y, } random variables are independent, and independent 

K 

of R and R _^n , we have 

n n+£ 



(2.31) P{X_^ = i, = j) = Ml)lt(j)P{R„ 

= r(i)7i(j)[l - r(l)] , 



by equation (2.16). 

Thus , for j C IE 

_ 1 N-m 

(2.32) lim B (m,j) r lim (N-m) I I l.(X.) 1 . (X. ) 

N^oo N->oo i^j \^=i 1 X J K+m 

= [1-Ti ( j ) ] [1-r (m) ] iT(j) 



almost surely where l^ (x) = 1 if x = i and 0 otherwise. 
Hence 

(2.33) r(m) = 1 - I B (m,j)[l - ^(j)]"^ 

j €IE 

is a strongly consistent estimator for the mth serial correla- 
tion for the stationary DARMA and NDARMA processes. 
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Estimator r (m) is also a strongly consistent estimator for 
the finite state space models of Lindqvist (1978) and Pegram 
(1980) since the conditional probabilities (1.2) and (1-6) 
are of the same form as those for the appropriate DAR(p) 
process . 

In the next section we pursue this estimator for the 
special case of the first-order autoregressive process DAR(l). 
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3. ESTIMATION FOR THE DAR(l) PROCESS 
Let ^ stationary DAR(l) process with state 

space IE = {0,1,...} and first-order serial correlation p , 
0 < p < 1 ; that is. 



(3.1) 



X 



n 



'x 



n-1 




with probability p , 
with probability 1-p 



for n = 1,2,..., while X^ is a random variable independent 
of but with the same probability m.ass function. 

A simulation was conducted to study the performance of 
several estimator for p for small to moderate series lengths 
m . The series lengths considered were m = 20 , 50, and 200. 
The marginal probability mass functions considered were the 
Poisson with parameter A , 

-A A^ 

(3.2) TT(k) = e ^ k = 0,1, . . . , 

and the geometric with parameter p 

(3.3) 7t(k) = p^(l-p) k = 0,1,... . 



One type of estimator considered was the single param- 
eter maximum likelihood estimator. For a series of length m 
let denote the number of times the DAR(l) process goes 

from i to j , for i, jelE and let N^ denote the total 
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number of times the process is in state j . The log-likelihood 



function for a DAR(l) series of length m is 



(3.4) 



00 

L = I I N £n[(l-p)^(j) ] 
i=0 j?^i 



oo 

+ I N. . £n[l - (l-p)TT(i)] 
i=0 



+ y 1 . (X, ) TT ( i) . 

iio ^ ^ 



Taking the partial derivative of L with respect to x = 1-p 
and setting the derivative equal to zero results, after some 
simplification, in the following equation for the maximum likeli- 
hood estimator x = 1-p , if it exists: 



(3.5) f(x) = 1 - N ^ y N . . {1 - x[l - TT(i)]} ^ = 0 . 

i=0 



Note that f(x) is monotone decreasing in x and f(0) ^ 0 . 
Hence, if there is a solution to (3.5) in [0,1], it will be 
unique . 

The ad-hoc estimator of p given at (2.33) was also 
considered. For the DAR(l) process this estimator is 



(3.6) P = 1 - I 

j=0 



[N 



-1 



I N. . ][1 - ^( j) ] 

i=0 J 



-1 
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The summations in both the maximum, likelihood estimators 
and the estimators of the form (3.6) for the Poisson case 
(respectively the geomietric case) were restricted to be between 
a = max[l, n_(y-7a)] (respectively a = max(l, n_(y-10o)) 
and b = n_^(y+7o) (respectively n_|_ ( y+10 a) ) ; here n_(y) (respective 
n_|^(y)) is the largest (respectively smallest) integer less 
(respectively greater) than y . 

Equation (3.5) was solved numerically. In the case 
N = 20 , it was not uncommon that f(x) did not have a zero in 
[0,1]. In this case, if f(l) > 0 , then x was set equal to 

/N 

1 ; that is, p = 0 . If f(0) = 0 , then x was taken to be 
0 ; that is, p = 1 . 

Other estimators for p that were considered included 
the following: 

1. The usual estimate for first-order serial correlation 
9 -1 -1 

(3.7) p^ = [Sj^] (m-1) ^ I (X^ - X) (X^_|_^ - X) , 

n=l 

where X and are as in (2.29) and (2.30) . If = 0 , 

then Pj^ was set equal to 1 . 

2. The maximum likelihood equation (3.5) was solved numer- 
ically for p for each of the following three values for Tr(j) : 

a. the known distribution (3.2) or (3.3) with known 
parameter A or p was used; P 2 denotes this estimator; 

b. the known distribution with an estimated parameter 
was used; that is. 
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(3.8) 



ir (k) 



where 



= e 



-A (A) 



k! 



A = X 



or 

(3.9) Tr(k) = (1-p) where p = x[l + X]~^ ; 



denotes this estimator; 

— 1 

c. the nonparametric estimator ir ( j ) = N of 

Tr(j) was used; denotes this estimator. 

d. p^ is the estimator of p resulting from the two- 
dimensional maximum likelihood estimator where the other 
parameter is the distribution parameter (A or p) ; 

3. the estimate p^ is the nonparametric estimate 

(3.6) using N ^ as the estimate for tt ( j ) ; 

4. the estimate p^ is the nonparametric estimate 

(3.6) using the true value of ir ( j ) ; 

Both estimators p^ and p^ can have negative values 
for small to moderate sample sizes. Hence, we also considered the 
following estimate. 

5. Estimator p„ = max(p, , 0 ) . 

o o 



3.2 The sampling experiment. 

A DAR(l) series of length m was simulated and the 
estimates for p were computed. The computation was repeated 
for 1000 independent replications and the sample mean, sample 
variance, and sample root mean square error were computed. Each 
experiment was then repeated for 20 independent replications, 
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and the mean of the means, mean of the standard deviations, and 
mean of the root mean square errors were computed. Tables 1-3 
give the means of the root mean square errors for the cases 
studied. The box plots of row values appearing in the last 
column of the table are given to help the reader to summarize 
the performance of the 8 estimators across the 7 cases considered. 

All runs were performed on an IBM system 360/67 computer 
at the Naval Postgraduate School using the LLRANDOM package 
[Learmonth and Lewis, 1973] which generates numbers according 
to the scheme given by Lewis, Goodman, and Miller [1969]. Tests 
of the random number generator are given in Learmonth and Lewis 
[1974 ] . 

Among all the estimates of p , the usual first-order 
serial correlation estimator, , performs least well. The 
maximum likelihood estimator with smallest root mean square 
error tends to be P 2 , although by the time m = 200 the 
difference is minor. Of course the value of p or A would 
not be known in general, so that this estimator is unrealistic. 
Maximum likelihood estimators p^ and p^ are about equiva- 
lent, indicating that the extra computational complexity of 
the two-parameter maximum likelihood estimator, p^ , is not 
necessary. The performance of the nonparametric estimator Pg 
is about the same or sometimes better than that of the maximum 
likelihood estimator p^ , especially if, for small sample size 
(m=20) , the modification pg = max(pg,0) is used. This sug- 
gests that the ad-hoc estimator of (3.6) altered to give values 
in the range [0,1] is almost as good as the maximum likelihood 
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estimator with the same estimate for rr ( j ) . The ad-hoc 

estimator is much easier to compute than the maximum likelihood 
estimator . 
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Table 1. Estimated Root Mean Square Error for several 

competing estimators of p for DAR(l) Series of length 




Table 3. Estimated Root Mean Square Error for several 
Estimators of p for DAR(l) Series of Length 

m = 200 
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4. 



EXTENSIONS 



Since the process obtained as a probabilistic 

mixture of the DARTIA and NDARMA process may be defined 

using any sequence of independent identically distributed random 

variables {Y } . One implication is that DARMA and NDARMA 

n 

processes may have a continuous marginal distribution. However, 

even if the distribution of Y is continuous, a realization 

n 

of the sequence will, in general, contain many runs of a 

single value. This seems to be the major drawback to using 
DARMA and NDAIMA processes to obtain a sequence of dependent 
random variables with a specified continuous distribution and 
correlation structure. However, the process with continuous 
marginals may be useful in simulation studies. 

Multivariate DARMA and NDARMA processes may be obtained 
by using a sequence of multivariate illustrate this 

we generate DARMA and NDARMA-like processes having negative 
correlations. These can be derived from bivariate processes 
as follows. 

Let { (Y (1), Y (-1))} be a sequence of independent 
n n 

bivariate random variables with state space IE = {0,+l»*-*} / 

marginal probability mass function n , and correlation 
r = Corr(Y^d), Y^(-l)) which will be negative in general. One 
way to generate such a sequence is to note that a random vari- 
able Y^(l) with probability mass function ir can be simulated 
from a uniform [0,1] random variable by defining 

j-1 0 

(4.1) Y (1) = j if I Tr(i) < U ^ I Tr(i) . 

^ i=_00 izr-oo 
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If Y^(-l) is generated by 

j-1 3 

(4.2) Y (-1) = j if I 7T(i) < 1 - U < I TT(i) , 

“ T=-00 i_-oo 



then (Y (1) , Y (-1) ) is called an antithetic pair. If tt is 

symmetric about zero, then Y^(l) = - Y^(-l) , and 

r = Corr(Y (1) , Y (-1) ) = -1 . 
n n 

A bivariate DARMA (p,N+l) process {(X^(l), X^(-l))} 
is defined as follows. Let {aQ,...,a^} and {bQ,...,bj^} be 
fixed sequences of numbers that are either -1 or 1 . Let 



(4.3) 



X (1) = U Y ,, 
n n n-B 



) 



n 



D 



n 



+ '"-V"n-(N+1) <®0> 



(4.4) x„(-i) = (-bp ) * a-%)V(N-n) <-^o> 

n n 



for n = 1 , 2 , . . . , 



where 



(4.5) 






V Z „ 
n n-A 



(a,. ) + 



n 



A 



n 






(4.6) Zp(-a„) = VpZ (-a ) 

n n 



6 (l-vpYp(-ap 



for n — -N, -N+1, . . . where } and {D } are as in Section 

n n 

2. The random variable X^(-l) is called the dual of X (1) 

n n 
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or the antithetic when (4.1) and (4.2) hold for the {Y } pair 

Note that if (4.1) and (4.2) hold and tt is syinrnetric about 

zero, then Z„(-D = - Z^(l) and X (-1) = - X (1) . 

n n n n 

t I 

A bivariate NDARMA(p,q) process (X^(l), X^(-l) ) can 
be defined similarly: 



(4.7) 



X (1) = V X’ ^ 
n n n-A 



. <^A > '^n-D "=D > 

n n n n 



(4.8) X„(-l) = (-a^ ) + (l-V^)V^.D (-bo > 

n n n n 



The stationary bivariate DARRA and NDARMA processes 
will have marginal probability mass function tt . A process 
having possibly negative correlations can be obtained by consider 
the marginal processes {X^(l)}, {X^(-l)}, {X^(l)}, {X^(-l)} . 

Details will be given elsewhere. 
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